Back

American Journal of Epidemiology

54 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Constructing and analyzing a synthetic life course cohort based on pooling two data sources: A case study of early adulthood depression symptomatology and late-life cognition
2026-02-27 epidemiology 10.64898/2026.02.25.26347113
Top 0.1% (5.9%)
Show abstract

BackgroundSynthetic cohorts created by combining two cohorts can be useful when no single data set includes both the exposure and outcome data of interest. We estimate the effects of depression in early adulthood on later-life memory outcome using two nationally representative cohorts separately and in a synthetic sample. MethodsWe used the National Longitudinal Study of Youth 1979 (NLSY; N=5,747) and the Health and Retirement Study (HRS; N=6,846) and a synthetic cohort combining exposure data ...

2
Quantifying bias from reverse causation in observational studies of dementia risk factors: A simulation study informed by age-specific reverse Mendelian Randomization
2026-02-23 epidemiology 10.64898/2026.02.21.26346807
Top 0.1% (5.9%)
Show abstract

BackgroundThe long preclinical phase of dementia can bias estimated effects of baseline exposures on dementia incidence. We demonstrate simulations informed by reverse Mendelian randomization (MR) findings to quantify the age-specific magnitude of reverse causation bias in analyses in observational studies of the effects of body mass index (BMI) on dementia. MethodsWe simulated longitudinal trajectories of BMI and dementia risk from ages 45 to 90 years, calibrating to published evidence on age-...

3
Associations of alcohol use in early and middle adulthood with mid- and late-life cognition - a synthetic cohort approach
2026-03-04 epidemiology 10.64898/2026.02.27.26346914
Top 0.2% (5.7%)
Show abstract

OBJECTIVEUsing two cohorts and synthetic datasets, we estimated effects of prospectively reported alcohol use on memory outcomes across middle age. METHODSData were from National Longitudinal Study of Youth 1979 (NLSY79, n=7540, alcohol reports from ages 18-26), Health and Retirement Study (HRS age 50-56 at enrollment, n=13,090), and a synthetic cohort matching early life exposure information from 3,259 NLSY79 participants to later life memory information from 5,451 HRS participants. Covariate-...

4
Growth mindset and grit as psychological resources in later life: Age, socioeconomic, and health patterning in the English Longitudinal Study of Ageing
2026-03-04 epidemiology 10.64898/2026.02.27.26347198
Top 0.2% (5.6%)
Show abstract

ObjectivesGrowth Mindset and Grit have been proposed as key psychological resources for resilience and adaptation, yet their manifestation and social distribution in later life remain underexplored. This study examines the structure, distribution, and correlates of Growth Mindset and Grit in older adulthood using proxy indicators in the English Longitudinal Study of Ageing (ELSA). MethodsProxy indicators reflecting learning behaviour, personality traits, affect, and beliefs were used to derive ...

5
Revised estimates of the types and durations of long Covid symptoms based on claims records from 245 Million US patients
2026-02-18 epidemiology 10.64898/2026.02.17.26346448
Top 0.2% (5.5%)
Show abstract

COVID-19 has been shown to cause a range of harmful long-term effects on nearly every organ system1-3. These findings are based on retrospective studies comparing COVID-19 patients to patients with similar medical histories and demographics but no COVID-19 diagnosis4-16. However, concerns have emerged that these comparisons may be biased if COVID-19 patients had unrelated health conditions or other factors not recorded in their medical records17-21. Here, using a massive dataset of 14.4 billion ...

6
Novel Representations of Vaccine Protection Against Progression to Severe Disease Over Time
2026-02-14 epidemiology 10.64898/2026.02.12.26346197
Top 0.2% (5.5%)
Show abstract

BackgroundVaccines can prevent severe disease by preventing infection or by reducing progression among those who become infected. Vaccine effectiveness against progression given infection is often used to quantify this second mechanism, but it conditions on infection, which is itself affected by vaccination. As a result, this estimand lacks a clear causal interpretation and may behave non-intuitively over time. MethodsWe introduce a conceptual framework that models protection against infection ...

7
Methodological Guidance for Predictor Variable Selection for Adolescent Smoking Outcomes in Global Youth Tobacco Survey Using R and Python
2026-02-17 epidemiology 10.64898/2026.02.14.26346305
Top 0.2% (5.5%)
Show abstract

BackgroundThe Global Youth Tobacco Survey (GYTS) is widely used to monitor tobacco use among adolescents worldwide. However, inconsistent analytical approaches particularly in handling complex survey designs and predictor selection limit comparability across countries, survey waves, and software platforms. Although much of the GYTS literature relies on proprietary tools such as SAS and SPSS, practical and transparent guidance on implementing reproducible, theory-informed analyses remains limited...

8
An E-value-Informed Sensitivity Analysis Framework for Hybrid Controlled Trials
2026-03-06 epidemiology 10.64898/2026.03.05.26347653
Top 0.3% (5.1%)
Show abstract

Hybrid controlled trials (HCTs) incorporate real-world data into randomized controlled trials (RCTs) by augmenting the internal control arm with patients receiving the same treatment in routine care. Beyond increasing power, HCTs may improve recruitment by supporting unequal randomization ratios that increase patient access to experimental treatments. However, HCT validity is threatened by bias from unmeasured confounding due to lack of randomization of external controls, leading to outcome non-...

9
Integrating stakeholder perspectives in modeling routine data for therapeutic decision-making
2026-02-18 epidemiology 10.64898/2026.02.18.26346074
Top 0.3% (4.9%)
Show abstract

BackgroundRoutinely collected health data are increasingly used to generate real-world evidence for therapeutic decision-making. Yet, stakeholders, including clinicians, pharmaceutical industry representatives, patient advocacy groups, and statisticians, prioritize different aspects of data quality, analysis, and interpretation. Without explicit consideration of these perspectives, analyses risk being fragmented, misaligned with end-user needs, or lacking transparency. MethodsWe developed a sta...

10
Exploring the exposome and unexplained variance in biological ageing - insights from a longitudinal twin study in adolescence and early adulthood
2026-03-04 epidemiology 10.64898/2026.03.03.26347499
Top 0.6% (3.9%)
Show abstract

Biological ageing begins before birth, with early-life exposures shaping late-life health. These exposures drive health inequities early, yet specific exposures and the composition of the ageing exposome remain largely undefined. This gap may persist as the field lacks agnostic investigations accounting for non-linearity, interactions and subtle signals. We aimed to identify exposures predictive of epigenetic ageing accumulated during childhood and adolescence and explore the composition of the...

11
Aging Out of the Blue: Estimating and Calibrating Region-specific Epigenetic Clocks for a Blue Zone via SuperLearner
2026-03-03 epidemiology 10.64898/2026.03.02.26346901
Top 0.6% (3.9%)
Show abstract

Epigenetic clocks estimate biological age from DNA methylation patterns at CpG sites, providing robust predictions of mortality and morbidity risk. "Blue zones"--regions of exceptional longevity--offer a unique opportunity to investigate how biological aging diverges from chronological age. However, standard clocks are typically trained on large, heterogeneous datasets, reflecting average population trends rather than region-specific dynamics. Using data from the Costa Rican Longevity and Health...

12
Longitudinal clustering of health behaviours and their association with multimorbidity: Evidence from Understanding Society (UKHLS)
2026-02-17 epidemiology 10.64898/2026.02.13.26346295
Top 0.6% (3.9%)
Show abstract

BackgroundSmoking, unhealthy nutrition, alcohol consumption, and physical inactivity (SNAP behaviours) are major risk factors for multimorbidity but are often studied in isolation. Using longitudinal data, Suhag et al. identified clusters of older adults (aged [≥]50) with common SNAP behaviour patterns and distinct sociodemographic profiles and multimorbidity prevalence; whether and how these patterns generalise across adulthood remains unclear. AimTo conceptually replicate Suhag et al. acro...

13
The Effect Of Smokers Transitioning To E-Cigarettes On Physical And Mental Health: An Emulated Trial Using Longitudinal Data.
2026-02-22 epidemiology 10.64898/2026.02.12.26345898
Top 0.8% (3.7%)
Show abstract

IntroductionTobacco smoking remains a leading cause of preventable death in the UK. Although e-cigarettes are promoted as a harm-reduction option, longitudinal evidence on short-term health outcomes across different smoking transition pathways is limited. This study examined short-term associations between transitions to exclusive e-cigarette use, dual use, or cessation and physical health, mental health, and health-related quality of life, compared with continued smoking. MethodsA target trial...

14
Early Population-Level Impact of Helicobacter pylori Eradication on Gastric Cancer Deaths in Japan: A Counterfactual Analysis of Short-Term Divergence
2026-02-26 epidemiology 10.64898/2026.02.24.26346975
Top 0.8% (3.7%)
Show abstract

BackgroundHelicobacter pylori infection accounts for 98% of gastric cancer (GC) cases in Japan. Since 2013, the nationwide expansion of H. pylori eradication therapy to chronic gastritis patients has created a unique opportunity to evaluate its population-level impact on GC primary prevention. However, short-term reductions in GC deaths are difficult to interpret given the long natural history of gastric carcinogenesis. This study aimed to assess the early impact of population-level eradication ...

15
Spatial Clustering of School Susceptibles Drives Divergent US Measles Outbreaks
2026-02-27 epidemiology 10.64898/2026.02.25.26347103
Top 0.9% (3.6%)
Show abstract

The two largest US measles outbreaks in over two decades (2025 Gaines County, Texas: 414 cases, contained; 2025-2026 Spartanburg County, South Carolina: 923+ cases, ongoing) occurred in counties with similar sub-threshold K-12 MMR coverage (85.1% vs 88.8%), yet their trajectories diverged dramatically. Using kernel density estimation with a common bandwidth and bootstrap uncertainty quantification, we compared sub-county vaccination data at the district level for Texas (3 districts, 3,560 studen...

16
Early life blood pressure, cognitive function and brain aging in mid-to-late life: A synthetic longitudinal cohort analysis
2026-02-26 epidemiology 10.64898/2026.02.24.26346790
Top 1.0% (3.6%)
Show abstract

PURPOSEOver 6.9 million Americans above the age of 65 are living with Alzheimers Disease (AD) or related dementias (ADRDs), which are diseases characterized by cognitive decline and structural brain changes associated with accelerated brain aging. Cardiovascular risk factors, in particular hypertension, are well-studied risk factors for AD/ARD. Evidence suggests that the effects of hypertension on cognitive aging may vary by life stage, yet prior studies have focused on the effects of mid- or la...

17
Assessing the risk of early-onset dementia within 5 years of cancer diagnosis
2026-02-15 epidemiology 10.64898/2026.02.12.26346204
Top 1% (3.5%)
Show abstract

ObjectiveTo evaluate risk of early-onset dementia (EOD) after diagnosis of cancer among Medicaid beneficiaries. DesignLongitudinal observational study of Medicaid enrollment, inpatient, and outpatient claims data from 26 states and Washington, DC, 2001-2019. MethodsBeneficiaries aged 18-64 with [≥]6 months of enrollment were matched 1:1 on cancer status (lung, colon, breast, prostate) by age, sex, race, year and state. We estimated the weighted cumulative incidence functions of EOD at 1, 2,...

18
Characterizing the impact of the COVID-19 pandemic on HIV testing among Medicaid beneficiaries
2026-02-14 epidemiology 10.64898/2026.02.12.26346199
Top 1% (3.0%)
Show abstract

ObjectivesEstimate the HIV testing, diagnoses, and test positivity rates among Medicaid beneficiaries in 2016-2021 and assess the impact of the COVID-19 pandemic on these outcomes. DesignProspective observational study of Medicaid enrollment, inpatient, and outpatient claims data from 27 states, 2016-2021. MethodsWe assessed Medicaid claims from adult beneficiaries with full benefits whose first continuous enrollment was [≥]6 months without dual enrollment in other insurance, and without pr...

19
Comparison of methods for assessing effects of risk factors on disease progression in Mendelian randomization under index event bias
2026-03-02 epidemiology 10.64898/2026.02.26.26347193
Top 1% (2.8%)
Show abstract

Mendelian randomization has emerged as a transformative approach for inferring causal relationships between risk factors and disease outcomes. However, applying Mendelian randomization to disease progression - a critical step in validating pharmacological targets - is hampered by index event bias. This form of selection bias occurs because analyses of disease progression are necessarily restricted to individuals who have already experienced the disease event. Here, we present a comprehensive eva...

20
COVID-19 hospitalizations in the Netherlands, 2023-2024: disease burden and vaccine effectiveness
2026-02-16 epidemiology 10.64898/2026.02.12.26346177
Top 1% (2.7%)
Show abstract

Since the cessation of real-time monitoring of COVID-19 hospitalizations in early 2024, the burden of and vaccine effectiveness (VE) against severe COVID-19 in the Netherlands was largely unknown. Recently, hospitalization data from 2024 were made available for the purpose of monitoring and evaluating the COVID-19 vaccination campaigns. These data were linked to the population registry, vaccination registry and healthcare use data (for classification into medical risk groups). We analyzed the n...